Selectivity estimation of Window queries for line segment datasets
نویسندگان
چکیده
Despite of the fact that large line segment datasets are becoming more and more popular, most of the analysis for estimating the selectivity of window queries posed on spatial data {the most important parameter for query optimization{ has focused on point or region data only. In this paper we move one signiicant step forward in line segment datasets theoretical analysis. We discovered that real lines closely follow a distribution law, that we named the SLED law (Segment LEngth Distribution). The SLED law can be used for an accurate estimation of the selectivity of window queries. Experiments on a variety of real line segment datasets (hydrographic systems, roadmaps, railroads, utilities networks) show that our law holds and that our formula is extremely accurate, enjoying a maximum relative error of 4% in estimating the selectivity.
منابع مشابه
Selectivity Estimation of Window Queries for Line Segment Datasets on Leave from Dipartimento Di Matematica
Despite of the fact that large line segment datasets are appearing more and more frequently in numerous applications involving spatial data, such as GIS 8, 9] multime-dia 6] and even traditional databases, most of the analysis for estimating the selectivity of window queries posed on spatial data {the most important parameter for query optimization{ has focused on point or region data only. In ...
متن کاملSelectivity Estimation of Window Queriesfor Line Segment
Despite of the fact that large line segment datasets are appearing more and more frequently in numerous applications involving spatial data, such as GIS 8, 9] multime-dia 6] and even traditional databases, most of the analysis for estimating the selectivity of window queries posed on spatial data {the most important parameter for query optimization{ has focused on point or region data only. In ...
متن کاملSelectivity Estimation for Predictive Spatio-Temporal Queries
This paper proposes a cost model for selectivity estimation of predictive spatio-temporal window queries. Initially, we focus on uniform data proposing formulae that capture both points and rectangles, and any type of object/query mobility combination (i.e., dynamic objects, dynamic queries or both). Then, we apply the model to non-uniform datasets by introducing spatio-temporal histograms, whi...
متن کاملAnalyzing Range Queries on Spatial Data
Analysis of range queries on spatial (multidimensional) data is both important and challenging. Most previous analysis attempts have made certain simplifying assumptions about the datasets and/or queries to keep the analysis tractable. As a result, they may not be universally applicable. This paper proposes a set of five analysis techniques to estimate the selectivity and number of index nodes ...
متن کاملSelectivity Estimation of High Dimensional Window Queries via Clustering
Query optimization is an important functionality of modern database systems and often based on estimating the selectivity of queries before actually executing them. Well-known techniques for estimating the result set size of a query are sampling and histogram-based solutions. Sampling-based approaches heavily depend on the size of the drawn sample which causes a trade-off between the quality of...
متن کامل